Ambiguities in recurrence-based complex network representations of time series 
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Recently, different approaches have been proposed for studying basic properties of time series 
from a complex network perspective. In this work, the corresponding potentials and limitations of 
networks based on recurrences in phase space are investigated in some detail. We discuss the main 
requirements that permit a feasible system-theoretic interpretation of network topology in terms 
of dynamically invariant phase space properties. Possible artifacts induced by disregarding these 
requirements are pointed out and systematically studied. Finally, a rigorous interpretation of the 
clustering coefficient and the betweenness centrality in terms of invariant objects is proposed. 
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During the last decade, increasing interest has arisen in 
structural and dynamical properties of complex networks 
[l|. Particular efforts have been spent on reconstruct- 
ing network topologies from experimental data, e.g., in 
ecology 0, social systems 0], neuroscience or at- 
mospheric dynamics The latter example yields a 
complex network representation of a continuous system, 
which suggests that applying a similar spatial discretiza- 
tion to the phase space of dynamical systems and using 
complex network methods as a novel tool for time series 
analysis could be feasible, too 0, For this purpose, 
different methods have been proposed (for a comparative 
review, see Q) and successfully applied to real- world as 
well as model systems. 

Many existing methods for transforming time series 
into complex network representations have in common 
that they define the connectivity of a complex network 
- similar to the spatio-temporal case - by the mutual 
proximity of different parts (e.g., individual states, state 
vectors, or cycles) of a single trajectory. In this work, 
we particularly consider recurrence networks, which are 
based on the concept of recurrences in phase space [1, 
[loj and provide a generic way for analyzing phase s pac e 
properties in terms of network topology 0, d, [TTI. Il2l|. 
Here, the basic idea is to interpret the recurrence matrix 
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associated with a dynamical system's trajectory, i.e., a 
binary matrix that encodes whether or not the phase 
space distance between two observed "states" x.^ and Xj 
is smaller than a certain recurrence threshold e, as the ad- 
jacency matrix of an undirected complex network. Since 
a single finite-time trajectory may however not necessar- 
ily represent the typical long-term behavior of the un- 
derlying system, the resulting network properties may 



depend ~ among others - on the length N of the con- 
sidered time series (i.e., the network size), the probabil- 
ity distribution of the data, embedding, sampling, etc. 
In the following, we present a critical discussion of the 
basic requirements for the application of recurrence net- 
works and show that their insufficient application leads to 
pitfalls in the system-theoretic interpretation of complex 
network measures. 

Threshold selection. The crucial algorithmic param- 
eter of recurrence-based time series analysis is e. Sev- 
eral invariants of a dynamical system (e.g., the 2nd-order 
Renyi entropy K2) can be estimated by taking its recur- 
rence properties for e — >• , which suggests that for a 
feasible analysis of recurrence networks, a low e is prefer- 
able as well. This is supported by the analogy to complex 
networks based on spatially extended systems, where at- 
tention is usually restricted to the strongest links between 
individual vertices (i.e., observations from different spa- 
tial coordinates) for retrieving meaningful information 
about relevant aspects of the systems' dynamics d, [![. 
In contrast, a high edge density 
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(with E^e) being the total number of edges for a chosen 
e) does not yield feasible information about the actually 
relevant structures, because these arc hidden in a large 
set of mainly less important edges. 

As a consequence, only those states should be con- 
nected in a recurrence network that are closely neigh- 
bored in phase space, leading to rather sparse networks. 
Following a corresponding rule of thumb recently con- 
firmed for recurrence quantification analysis [l^l , we sug- 
gest choosing e as corresponding to an edge density 
P < 0.05 Hi], which yields neighborhoods covering 
appropriately small regions of phase space. Note that 
since many topological features of recurrence networks 
are closely related to the local phase space properties of 
the underlying attractor 0, the corresponding informa- 
tion is best preserved for such low e unless the presence 
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FIG. 1: (Color online) Effects of difTerent metrics and embeddings on the p(e) relationship, expressed in terms of the correspond- 
ing first derivative. (A,B,C,D): Manhattan distance; (E,F,G,H): Euclidean distance; (I,J,K,L): maximum distance. (A,E,I): 
Lorenz system (i — 10{y — x), y = x{28 — z), z = xy — |z) with original components at three different randomly chosen initial 
conditions. (B,F,J): Same Lorenz system embedded from the x component with embedding delays ri — 5,T2 = 15, and rs — 20. 
(C,G,K): same as (A,E,I) for the Rossler system (i = —y — z, y = x + 0.2y, z = z{x — 5.7)). (D,H,L): same as (B,F,J) for the 
Rossler system with n = 10, T2 = 15 and rs = 20. Circles indicate the respective maxima. In all cases, time series of A'^ = 1, 000 
points with a sampling time of At = 0.05 have been used, obtained with a 4th-order Runge-Kutta integrator with fixed step 
width h = 0.01. The values of T2 are guided by the first zeros of the corresponding auto-correlation functions. 



of noise requires higher e [l3[. 

Recently, a heuristic criterion has been proposed by 
Gao and Jin, which selects e as the (supposedly unique) 
turning point Scrit in the p(e) relationship of certain dy- 
namical systems (l2j . formally reading 
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In contrast to our above considerations, for different real- 
izations of the Lorenz system, this turning point criterion 
yields link densities of Pcrit = pi^ = ^crit) ^ 0.15 ... 0.3 
p^ . implying that considerably large regions of the at- 
tractor are covered by the corresponding neighborhoods. 
In such cases, it is however not possible to attribute 
certain network features to specific small-scale attractor 
properties in phase space. More generally, e should be 
chosen in such a way that small variations in e do not 
induce large variations in the results of the analysis. In 
contrast, the turning point criterion ^ explicitly selects 
e such that small perturbations in its value will result in 
a maximum variation of the results. Moreover, besides 
our general considerations supporting low e, application 
of the turning point criterion leads to serious pitfalls: 
(i) Scrit and, hence, Pcrit depend on the specific met- 



ric used for defining distances in phase space (Fig. [T|). 
Moreover, experimental time series often contain only a 
single scalar variable, so that embedding might be neces- 
sary. Since the detailed shape of the attractor in phase 
space is affected by the embedding parameters, chang- 
ing the embedding delay has a substantial effect on Scrit, 
which is particularly visible in the Rossler system (see 
Fig. [1] (D,H,L)). An improper choice of embedding pa- 
rameters would further increase the variance of Ecrit and 
will generally not yield meaningful results. In a similar 
way, depending on the choice of the other parameters 
the sampling time of the time series may also influence 
the recurrence properties [l^ (and, hence, Scrit), since 
temporal coarse-graining can cause a loss of detections 
of recurrences. 

(ii) The e-sclcction should be as independent as pos- 
sible of the particular realization of the studied system, 
especially from the initial conditions and the length N of 
the time series. The turning point Scrit after conditions 
([5]) is however not independent of the specific initial con- 
ditions (Fig. [T] (A,E,I) and (C,G,K)): while its average 
value docs not change much with changing iV, there is 
a large variance among the individual trajectories that 
converges only slowly with increasing N (Fig. [2]). Hence, 
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FIG. 2; Mean values (squares) and range (shaded areas) of 
turning points Scru in 200 independent realizations of the 
Lorenz (A) and the Rossler system (B) in dependence on the 
network size A'^ (Euclidean distance, At = 0.05). The insets 
show the corresponding link densities pcrit for the same range 
of A. 
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FIG. 3: Examples for multiple turning points of dp/de: (A) 
quasiperiodic trajectory of a continuous system (torus) and 
(B) a weakly chaotic orbit of the standard map {xn+i = Xn + 
5sin(y„) mod 1, t/n+i = t/n + Xn+i mod 1, see [l5l]l. 



for the same system and the same network size, already 
slightly different initial conditions may yield strong dif- 
ferences in Ecrit and Pcrit (Fig. El inset) and, hence, the 
topological features of the resulting networks. 

(iii) One has to emphasize that the turning point cri- 
terion is not generally applicable, since there are various 
typical examples for both discrete and continuous dy- 
namical systems that are characterized by several max- 
ima of dp{e)/de (Fig. El). 

The above considerations are mainly of concern when 
studying properties of (known) dynamical systems. In 
applications to real-world time series with typically a 
small number of data or even non-stationarities, it is 
still possible to derive meaningful qualitative results from 
small time series networks. However, for a detailed 
system-theoretic interpretation the use of smaller recur- 
rence thresholds is recommended Q. 

Topology of recurrence networks. The topological fea- 
tures of recurrence networks are closely related to in- 
varian t p roperties of the observed dynamical system 
0, H, However, a system-theoretic interpretation 

of the resulting network characteristics is feasible only 
based on a careful choice of e, avoiding the pitfalls out- 
lined above. For example, many paradigmatic network 
models as well as real- world systems have been reported 
to possess small- world properties (i.e., a high clustering 
coefficient C and low average path length £). However, 
it can be shown that C and C are both functions of e. 



In particular, C ^ 1/e (for given A''), since spatial dis- 
tances are approximately conserved in recurrence net- 
works, whereas the specific e-dependence of C varies be- 
tween different systems. 

In addition to the aforementioned global network char- 
acteristics, specific vertex properties characterize the lo- 
cal attractor geometry in phase space in some more de- 
tail, where the spatial resolution is determined by e. In 
particular, the local clustering coefficient Cy, which quan- 
tifies the relative amount of triangles centered at a given 
vertex v, gives important information about the geomet- 
ric structure of the attractor within the e-neighborhood 
of V in phase space. Specifically, if the neighboring states 
form a lower-dimensional subset than the attractor, it is 
more likely that closed triangles emerge than for a neigh- 
borhood being more uniformly filled with states jig . 
Hence, high values of Cu indicate lower-dimensional struc- 
tures that may correspond to laminar regimes or dy- 
namically invariant objects like unstable periodic orbits 
(UPOs) The relationship with UPOs follows from 
the fact that trajectories tend to stay in the vicinity of 
such orbits for a finite time [l7j , which leads to a certain 
amount of states being accumulated along the UPO with 
a distinct spatial geometry that differs from that in other 
parts of a chaotic attractor. However, since there are in- 
finitely many UPOs embedded in chaotic attractors, such 
objects (even of a low order) can hardly be detected using 
large e (where the resulting neighborhoods cover differ- 
ent UPOs) and short time series as recently suggested 
[l^ . In contrast, theymay be well identified using low e 
and long time series @. 

Another intensively studied vertex property is be- 
twcenncss centrality by, which quantifies the relative 
number of shortest paths in a network that include a 
given vertex w In a recurrence network, vertices 

with high by correspond to regions with low phase space 
density that are located between higher density regions. 
Hence, by yields information about the local fragmen- 
tation of an attractor. In particular, since phase space 
regions close to the outer boundaries of the correspond- 
ing attractors do not contribute to many shortest paths, 
the vertices located in these regions are characterized by 
low by, which is (at least for the Lorenz oscillator) even 
enhanced by a lower state density. For the sharp inner 
boundary of the Rossler oscillator, one may observe the 
opposite behavior. For phase space regions close to low- 
pcriod UPOs, one also finds lower values of by due to the 
accumulation of states along these structures (many al- 
ternative paths). As the distribution of 6^ (Fig.|31(A,B)) 
suggests, these features are robust for low e, but may 
significantly change if e gets too large (i.e., p ~ 0.2). 

We conclude that in a recurrence network, both Cy and 
by are sensitive to the presence of UPOs, but resolve com- 
plementary aspects (see Fig. HI). For the Rossler system, 
we find two distinct maxima in the betweenness distri- 
bution, which are related to the inner and outer parts 
of the attractor, respectively. In particular, the abun- 
dance of low values is promoted by a high state den- 
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FIG. 4: (Color online) Probability distribution function of 
betweenness centrality &„ (in logarithmic scale) for different 
edge densities (pi = 0.005, p2 = 0.01, pa = 0.015, p4 = 0.2) 
for the Lorenz (A, A'^ = 20,000) and Rossler system (B, 
A*' — 10,000), and corresponding relationships between lo- 
cal clustering coefficient C„ and betweenness centrality b„ (C: 
Lorenz, D: Rossler, p — 0.01) obtained from the original data 
using the Euclidean distance (At = 0.05). 

sity at the outer boundary of the attractor near the x- 
y plane, which coincides with a period-3 UPO [l^. In 
contrast, for the Lorenz system there is no second max- 
imum of p{by), since the outer parts of the attractor are 
more diffuse and characterized by a considerably lower 
phase space density than in the Rossler attractor. In 
both cases, vertices with a high clustering coefHcient Cy 
are characterized by a broad continuum of betweenness 
values, which suggests that by is no universal indicator 
for the presence of UPOs, whereas Cy allows an approx- 
imate detection of at least low-periodic UPOs in phase 
space. 

In summary, transforming time series into complex 
networks yields complementary measures for character- 
izing phase space properties of dynamical systems. This 
work has provided empirical arguments that the recently 
suggested approach based on the recurrence properties 
in phase space allows a detailed characterization of dy- 



namically relevant aspects of phase space properties of 
the attractor, given that (i) the considered time series 
is long enough to be representative for the system's dy- 
namics and (ii) the threshold distance e in phase space 
for defining a recurrence is chosen small enough to resolve 
the scales of interest. In particular, using the network- 
theoretic measures discussed here, the turning point cri- 
terion for threshold selection [l^] often docs not allow fea- 
sible conclusions about dynamically relevant structures 
in phase space. In contrast, for sufficiently low recur- 
rence thresholds (we suggest p < 0.05 as a rule of thumb), 
small-scale structure may be resolved appropriately by 
complex network measures, which allow identification 
of invariant objects such as UPOs by purely geometric 
means. We emphasize that although our presented 
considerations have been restricted to paradigmatic ex- 
ample systems, recurrence networks and related meth- 
ods have already been successfully applied to real-world 
data, e.g., a paleoclimate record Q or seismic activ- 
ity [19| . Since these examples are typically characterized 
by non-stationarities and non-deterministic components, 
we conclude that recurrence networks are promising for 
future applied research on various interdisciplinary prob- 
lems (e.g., lU). 

As a final remark, we note that the problem of parame- 
ter selection arises for most other network-based methods 
of time series analysis (see Q for a detailed comparison) . 
Important examples include cycle networks with a cor- 
relation threshold, and fc-nearest neighbor networks [T]| 
with a fixed number k of neighbors as free parameters, 
respectively. A general framework for parameter selec- 
tion in the context considered here would consequently 
be desirable. Other methods are parameter-free, but may 
suffer from conceptual limitations and strong intrinsic as- 
sumptions. For example, the currently available visibility 
graph concepts [l^l are restricted to univariate time se- 
ries. 
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